跳轉到主要內容

Post-Training & Forgetting

Terminology definition

LLM = foundation model ---(Post-Training = continual training = Alignment )----> Fine-tuned Model

  • Foundation Model -> can be chat/instruct model or base/Pretrain model

Methods

  1. Pre-train style
  2. SFT Style
  3. RL style

Catastrophic Forgetting

  • 這個是post training 最大的挑戰,他會忘記已有的技能.

Solutions of Catastrophic Forgetting

  1. Experience Replay
  2. Pseudo Experience Replay
  3. Paraphrase
    • 用自己的話換句話說
    • Input: new
    • Output: new --> Foundation Model (換句話說) --> Old
  4. Self-output
資訊

可能RL-based post training ( less forgetting? )